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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/938, 548 A 



DATE: 07/24/98 
TIME: 12:44:08 



INPUT SET; S27673.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 
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SEQUENCE LISTING 



(1) 



General Information 



ENTERED 



(i) APPLICANT: Yanagisawa, Masashi 
Bergsma, Derk 
Wilson, Shelagh 
Brooks, David 
Gellai, Miklos 

(ii) TITLE OF THE INVENTION : NOVEL LIGANDS OF THE NEUROPEPTIDE 

RECEPTOR HFGAN72 

/ 

(iii) NUMBER OF SEQUENCES : -21 — 

(iv) CORRESPONDENCE ADDRESS: 



(A 
(B 
(C 
(D 
(E 
(F 



(v) COMPUTER READABLE FORM: 



(A 
(B 
(C 
(D 

(vi) 
(A 
(B 
(C 

(vii) PRIOR 
(A 
(B 

(vii 
(A 
(B 

(A 
(B 



ADDRESSEE: SmithKline Beecham Corporation 
STREET: 709 Swedeland Road 
CITY: King of Prussia 
STATE : PA 

COUNTRY: United States of America 
ZIP: 19406 



MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2,0 

CURRENT APPLICATION DATA: 
APPLICATION NUMBER: 08/938,548 
FILING DATE: 26-SEPT-1997 
CLASSIFICATION: 

APPLICATION DATA: 
APPLICATION NUMBER: 08/887,382 
FILING DATE: 2-JUL-1997 

PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 08/820,519 
FILING DATE: 19-MAR-1997 

APPLICATION NUMBER: 60/033,604 
FILING DATE: 17-DEC-1997 



(viii) ATTORNEY/ AGENT INFORMATION: 




PAGE: 2 RAW SEQUENCE LISTING DATE: 07/24/98 

PATENT APPLICATION US/08/938,548A TIME: 12:44:10 

INPUT SET: S27673.raw 

47 (A) NAME : King, William T. 

48 (B) REGISTRATION NUMBER: 30,954 

49 (C) REFERENCE/ DOCKET NUMBER: ATG50037-2 
50 

51 (ix) TELECOMMUNICATION INFORMATION: 

52 (A) TELEPHONE: 610-270-5219 

53 (B) TELEFAX: 610-270-4026 

54 (C) TELEX: 
55 

56 (2) INFORMATION FOR SEQ ID NO:l: 

57 

58 (i) SEQUENCE CHARACTERISTICS: 

59 (A) LENGTH: 1970 base pairs 

60 (B) TYPE: nucleic acid 

61 (C) STRANDEDNESS: single 

62 (D) TOPOLOGY: linear 
63 

64 (ii) MOLECULE TYPE: Genomic DNA 

65 

66 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

67 

68 AAAACATAAT GTGGGTCTCG CGTCTGCCTC TCTCCCGCCC CTAATTAGCA GCTGCCTCCC 60 

6 9 TCCATATTGT CCCAGGCCAG CGCTTCTTTT GTGCTCCCAG ATTCCTGGGT GCAAGGTGGC 120 

70 CTCATTAGTG CCCGGAGACC GCCCCATCTC CAGGGAGCAG ATAGACAGAC AAGGGGGTGA 180 

71 TCAGGGGCAC AGTGATCCAA CCCTGGCCTC TGAACGCCGC AGCGGCCATT CCTTGGGCCC 240 

72 AGCCTGGAGA CGGCCCCCCT GCAGCAGGCT AATCTTAGAC TTGCCTTTGT CTGGCCTGGG 300 

7 3 TGTGGACGCA ATGTGCCTGT CAATTCCCCG CCACCTCAGA GCACTATAAA CCCCAGACCC 360 

74 CTGGGAGTGG GTCACAATTG ACAGCCTCAA GGTTCCTGGC TTTTTGAACC ACCACAGACA 420 

75 TCTCCTTTCC CGGCTACCCC ACCCTGAGCG CCAGACACCA TGAACCTTCC TTCCACAAAG 480 

76 GTAAAGATCC AGGGATGGAG GGGTGACTCA GCCATCCCAG AGGAAGCAAA AAGAGTGCTT 540 

77 GCTCAGAGGG CTGGAAGAAA GGCCAAAGGT GTCTCCACTC TTGGTCTTTT CCTGGGTGTG 600 

78 CTCTGAGGCA GGAGCACCTG CCTTGGCTCA CATTGGGTTG GGTGCTGTTT TGCTAAGAGC 660 
7 9 CTGTGTTTGC TGAGCTCATA TGTGTCAGGT GCTCCGTTTG CACCTGTCAT CTCTTGTCAT 720 

80 CCTCCCAACA GCCTTGCAGA GTAGAAATTA TTTCTAGTAT ACCCAGTTTA CAGGTAAGGG 780 

81 AGCTGTGCCC TCTGAAAGGG CAGGAAACTG GTTCAAAGCA ACGGAGTTCA GTCACTCCTG 84 0 

82 CAAGGGGGCA GGCAGATGAG AGAGCATTCT GGAGTCTTGC TAGTTCCTGA TTTCCATGTG 900 

83 TTTCCCTGCT GTGGAGAGGA AGTTGGGGGG ACTCAGTAGG GCCCGGGTTT TTCCCAAGTT 960 

84 TACAACTTCT GCTGCAGACA GACACTCCTG TTTTCAGGTG GAGTGGCAAG TGCCCTAGTG 1020 

85 GTGGCAACAG TGGCCTAAGT CTCCAGAGAA AAGGGGGATT CACTCTGCCC AGGGGGTCTC 1080 

86 AAAAGGCTTC CTGTGGGAGA TGCTCTGCTG GGTCTTGAAG GAGGAGCAGG GAAAGTAGGC 114 0 

87 CGATACCAGC AAGGGCGCAA AGCAAGGAGA ACTAAGTGAC AGCCAGAAAG GAGTGCAGGC 1200 

88 TTGGAGGGGG CGCGGAGCCA GAGGGGCAGG TCCTGTGCGT GGGAGCTGGT GGCGGGCGCC 1260 

89 GTGGGAAGAC CCCCCCAGCG CCCTGTCTCC GTCTCCCTAG GTCTCCTGGG CCGCCGTGAC 1320 

90 GCTACTGCTG CTGCTGCTGC TGCTGCCGCC CGCGCTGTTG TCGTCCGGGG CGGCTGCACA 1380 

91 GCCCCTGCCC GACTGCTGTC GTCAAAAGAC TTGCTCTTGC CGCCTCTACG AGCTGCTGCA 1440 

92 CGGCGCGGGC AATCACGCGG CCGGCATCCT CACGCTGGGC AAGCGGAGGT CCGGGCCCCC 1500 

93 GGGCCTCCAG GGTCGGCTGC AGCGCCTCCT GCAGGCCAGC GGCAACCACG CCGCGGGCAT 1560 

94 CCTGACCATG GGCCGCCGCG CAGGCGCAGA GCCAGCGCCG CGCCCCTGCC TCGGGCGCCG 1620 

95 CTGTTCCGCC CCGGCCGCCG CCTCCGTCGC GCCCGGAGGA CAGTCCGGGA TCTGAGTCGT 1680 

96 TCTTCGGGCC CTGTCCTGGC CCAGGCCTCT GCCCTCTGCC CACCCAGCGT CAGCCCCCAG 1740 

97 AAAAAAGGCA ATAAAGACGA GTCTCCATTC GTGTGACTGG TCTCTGTTCC TGTGCGGTCG 1800 

98 CGTCCTGCCC ATCCGGGGTG GCAAAGCGTC TTGCGGAGGA CAGCTGGGCC TGGAAGCCCG 1860 
9 9 GCTGTCGGGC ACCAGCCTTA GCTTTTGCGT GGTTGAATCG GAAACACTCT TGGTTGGGGA 1920 



PAGE: 3 RAW SEQUENCE LISTING DATE: 07/24/98 

PATENT APPLICATION US/08/938, 548 A TIME: 12:44:1 1 

INPUT SET: S27673.raw 

100 GTTCCCAGTG CAAGGCCCTG GGGCACAGAG AGAACTGCAC AGGTGCATGC 1970 
101 

102 (2) INFORMATION FOR SEQ ID NO: 2: 

103 

104 (i) SEQUENCE CHARACTERISTICS: 

105 (A) LENGTH: 131 amino acids 

106 (B) TYPE: amino acid 

107 (C) STRANDEDNESS: single 

108 (D) TOPOLOGY: linear 
109 

110 (ii) MOLECULE TYPE: protein 

111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 

133 (2) INFORMATION FOR SEQ ID NO: 3: 

134 

135 (i) SEQUENCE CHARACTERISTICS: 

136 (A) LENGTH: 33 amino acids 

137 (B) TYPE: amino acid 

138 (C) STRANDEDNESS: single 

139 (D) TOPOLOGY: linear 
140 

141 (ii) MOLECULE TYPE: protein 

142 

14 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

144 

145 Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 

146 1 5 10 15 

147 Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 

148 20 25 30 

149 Leu 
150 

151 (2) INFORMATION FOR SEQ ID NO: 4: 
152 





(Xi) ! 


SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 2 


> * 






Met 


Asn 


Leu 


Pro 


Ser 


Thr 


Lys 


Val 


Ser 


Trp 


Ala 


Ala 


Val 


Thr Leu Leu 
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Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 


Pro 


Ala 


Leu 


Leu 


Ser 


Ser 


Gly Ala Ala 
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Ala 


Gin 


Pro 


Leu 


Pro 


Asp 


Cys 


Cys 


Arg 


Gin 


Lys 


Thr 


Cys 


Ser Cys Arg 
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Leu 


Tyr 


Glu 


Leu 


Leu 


His 


Gly 


Ala 


Gly 


Asn 


His 


Ala 


Ala 


Gly He Leu 
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Thr 


Leu 


Gly 


Lys 


Arg 


Arg 


Ser 


Gly Pro 


Pro 


Gly 


Leu 


Gin 


Gly Arg Leu 
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Gin 


Arg 


Leu 


Leu 


Gin 


Ala 


Ser 


Gly 


Asn 


His 


Ala 


Ala 


Gly 


He Leu Thr 
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Met 


Gly 


Arg 


Arg 


Ala 


Gly 


Ala 


Glu 


Pro 


Ala 


Pro 


Arg 


Pro 


Cys Leu Gly 
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Arg 


Arg 


Cys 


Ser 


Ala 


Pro 


Ala 


Ala 


Ala 


Ser 


Val 


Ala 


Pro 


Gly Gly Gin 
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Gly 


He 
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PAGE: 4 RAW SEQUENCE LISTING DATE: 07/24/98 

PATENT APPLICATION US/08/938, 548A TIME: 12:44:13 

INPUT SET: S27673.raw 

15 3 (i) SEQUENCE CHARACTERISTICS : 

154 (A) LENGTH: 28 amino acids 

155 (B) TYPE: amino acid 

15 6 (C) STRANDEDNESS: single 

157 (D) TOPOLOGY: linear 
158 

159 (ii) MOLECULE TYPE: protein 
160 

161 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
162 

163 Arg Ser Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin 

164 15 10 15 

165 Ala Ser Gly Asn His Ala Ala Gly He Leu Thr Met 

166 20 25 
167 

168 (2) INFORMATION FOR SEQ ID NO: 5: 
169 

170 (i) SEQUENCE CHARACTERISTICS: 

171 (A) LENGTH: 585 base pairs 

172 (B) TYPE: nucleic acid 

173 (C) STRANDEDNESS: single 

174 (D) TOPOLOGY: linear 
175 

176 (ii) MOLECULE TYPE: cDNA 
177 

17 8 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
179 

180 GGCTCGGCGG CCTCAGACTC CTTGGGTATT TGGACCACTG CACCGAAGAT ACCATCTCTC 60 

181 CGGATTGCCT CTCCCTGAGC TCCAGACACC ATGAACCTTC CTTCTACAAA GGTTCCCTGG 120 

182 GCCGCCGTGA CGCTGCTGCT GCTGCTACTG CTGCCGCCGG CGCTGCTGTC GCTTGGGGTG 180 

183 GACGCGCAGC CTCTGCCCGA CTGCTGTCGC CAGAAGACGT GTTCCTGCCG TCTCTACGAA 240 

184 CTGTTGCACG GAGCTGGCAA CCACGCCGCG GGCATCCTCA CTCTGGGAAA GCGGCGACCT 300 

185 GGACCCCCAG GCCTCCAAGG ACGGCTGCAG CGCCTCCTTC AGGCCAACGG TAACCACGCA 360 

186 GCTGGCATCC TGACCATGGG CCGCCGCGCA GGCGCAGAGC TAGAGCCATA TCCCTGCCCT 420 

187 GGTCGCCGCT GTCCGACTGC AACCGCCACC GCTTTAGCGC CCCGGGGCGG ATCCAGAGTC 480 

188 TGAACCCGTC TTCTATCCCT GTCCTAGTCC TAACTTTCCC CTCTCCTCGC CGGTCCCTAG 540 

189 GCAATAAAGA CGTTTCTCTG CTAAAAAAAA AAAAAAAAAA AAAAA 585 
190 

191 (2) INFORMATION FOR SEQ ID NO: 6: 
192 

193 (i) SEQUENCE CHARACTERISTICS: 

194 (A) LENGTH: 130 amino acids 

195 (B) TYPE: amino acid 

196 (C) STRANDEDNESS: single 

197 (D) TOPOLOGY: linear 
198 

199 (ii) MOLECULE TYPE: protein 
200 

201 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
202 

203 Met Asn Leu Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 

204 15 10 15 

205 Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 
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229 
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232 
233 
234 
235 
236 
237 
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RAW SEQUENCE LISTING DATE: 07/24/98 

PATENT APPLICATION US/08/938,548A TIME: 12:44:15 

INPUT SET: S27673.ruw 
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Gin Pro 


Leu 


Pro 


Asp 


Cys 


Cys 


Arg 


Gin 


Lys 


Thr 


Cys 


Ser 


Cys Arg 


Leu 
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Tyr Glu 


Leu 


Leu 


His 


Gly Ala 


Gly 


Asn 


His 


Ala 


Ala 


Gly 


He Leu 


Thr 
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Leu Gly 


Lys 


Arg 


Arg 


Pro 


Gly 


Pro 


Pro 


Gly 


Leu 


Gin 


Gly Arg Leu 


Gin 
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Arg Leu 


Leu 


Gin 


Ala 


Asn 


Gly 


Asn 


His 


Ala 


Ala 


Gly 


He 


Leu Thr 


Met 
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Gly Arg 


Arg 


Ala 


Gly 


Ala 


Glu 


Leu 


Glu 


Pro 


Tyr 


Pro 


Cys 


Pro Gly Arg 
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Arg Cys 


Pro 


Thr 


Ala 


Thr 


Ala 


Thr 


Ala 


Leu 


Ala 


Pro 


Arg 


Gly Gly 


Ser 



115 120 125 



Arg Val 
130 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Asn Leu Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 

1 5 10 15 

Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 
20 25 30 



(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 

15 10 15 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO : 9 : 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/938,548A 



DATE: 07/24/98 
TIME: 12:44:17 



INPUT SET: S27673.raw 



Original Text 



